3574 results found.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons License: Attribution 4.0 International
Size:
10 GByte Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Speech Denoising With Deep Feature Losses
-
Paper track:6.4 Speech enhancement: single-channel/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Francois Germain | Noisy speech database for training speech enhancement algorithms and TTS models | /N |
Documentation:
Yes/English/Yes
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From LDC
License:
LDC (with amendments, see site for details)
Size:
180 hours Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:Challenging the Boundaries of Speech Recognition: The MALACH Corpus
-
Paper track:12.6 Speech and multimodal resources/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Michael Picheny | USC-SFI MALACH Interviews and Transcripts English - Speech Recognition Edition | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC2019S11/README.pdf
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
2 GByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
-
Paper track:8.8 Acoustic model adaptation (e.g. bandwidth, emo/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emiru Tsunoo | Speech Commands | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
-
Paper track:8.8 Acoustic model adaptation (e.g. bandwidth, emo/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emiru Tsunoo | Wall Street Journal (WSJ) Corpus | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
-
Paper track:8.8 Acoustic model adaptation (e.g. bandwidth, emo/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emiru Tsunoo | Wall Street Journal (WSJ) Corpus | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
-
Paper track:8.8 Acoustic model adaptation (e.g. bandwidth, emo/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Emiru Tsunoo | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
proprietary
Size:
~2300 hours Production Status:
Existing-used
Use:
spoken language proficiency assessment
-
Paper title:Automatic Detection of Off-topic Spoken Responses Using Very Deep Convolutional Neural Networks
-
Paper track:12.14 Evaluation of spoken language technology/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xinhao Wang | ETS_Speech_OffTopic | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
BSD-style open software license
Size:
1131 prompts OtherProduction Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Semi-supervised voice conversion with amortized variational inference
-
Paper track:8.6 Neural network training methods (including new/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Cory Stephenson | CMU ARCTIC database | /N |
Documentation:
See documentation: http://www.festvox.org/cmu_arctic/cmu_arctic_report.pdf
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons BY-NC-SA 4.0
Size:
1.3 MByte Production Status:
Newly created-in progress
Use:
Pronunciation Scoring
-
Paper title:EpaDB: a database for development of pronunciation assessment systems
-
Paper track:12.19 Other topics in Spoken Language Processing: /Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jazmín Vidal | EpaDB | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
40 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:NIESR: Nuisance Invariant End-to-end Speech Recognition
-
Paper track:8.3 Robustness against noise or reverberation/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | I-Hung Hsu | CHiME3 | /N |
Documentation:
None




